Pesquisa | Portal Regional da BVS

1.

Predicting variable gene content in Escherichia coli using conserved genes.

Nguyen, Marcus; Elmore, Zachary; Ihle, Clay; Moen, Francesco S; Slater, Adam D; Turner, Benjamin N; Parrello, Bruce; Best, Aaron A; Davis, James J.

mSystems ; 8(4): e0005823, 2023 08 31.

Artigo em Inglês | MEDLINE | ID: mdl-37314210

RESUMO

Having the ability to predict the protein-encoding gene content of an incomplete genome or metagenome-assembled genome is important for a variety of bioinformatic tasks. In this study, as a proof of concept, we built machine learning classifiers for predicting variable gene content in Escherichia coli genomes using only the nucleotide k-mers from a set of 100 conserved genes as features. Protein families were used to define orthologs, and a single classifier was built for predicting the presence or absence of each protein family occurring in 10%-90% of all E. coli genomes. The resulting set of 3,259 extreme gradient boosting classifiers had a per-genome average macro F1 score of 0.944 [0.943-0.945, 95% CI]. We show that the F1 scores are stable across multi-locus sequence types and that the trend can be recapitulated by sampling a smaller number of core genes or diverse input genomes. Surprisingly, the presence or absence of poorly annotated proteins, including "hypothetical proteins" was accurately predicted (F1 = 0.902 [0.898-0.906, 95% CI]). Models for proteins with horizontal gene transfer-related functions had slightly lower F1 scores but were still accurate (F1s = 0.895, 0.872, 0.824, and 0.841 for transposon, phage, plasmid, and antimicrobial resistance-related functions, respectively). Finally, using a holdout set of 419 diverse E. coli genomes that were isolated from freshwater environmental sources, we observed an average per-genome F1 score of 0.880 [0.876-0.883, 95% CI], demonstrating the extensibility of the models. Overall, this study provides a framework for predicting variable gene content using a limited amount of input sequence data. IMPORTANCE Having the ability to predict the protein-encoding gene content of a genome is important for assessing genome quality, binning genomes from shotgun metagenomic assemblies, and assessing risk due to the presence of antimicrobial resistance and other virulence genes. In this study, we built a set of binary classifiers for predicting the presence or absence of variable genes occurring in 10%-90% of all publicly available E. coli genomes. Overall, the results show that a large portion of the E. coli variable gene content can be predicted with high accuracy, including genes with functions relating to horizontal gene transfer. This study offers a strategy for predicting gene content using limited input sequence data.

Assuntos

Anti-Infecciosos , Proteínas de Escherichia coli , Escherichia coli/genética , Genoma Bacteriano/genética , Plasmídeos , Proteínas de Escherichia coli/genética

2.

Providing a Safe, In-Person, Residential College Experience During the COVID-19 Pandemic.

Travis, Scott A; Best, Aaron A; Bochniak, Kristyn S; Dunteman, Nicole D; Fellinger, Jennifer; Folkert, Peter D; Koberna, Timothy; Kopek, Benjamin G; Krueger, Brent P; Pestun, Jeff; Pikaart, Michael J; Sabo, Cindy; Schuitema, Alex J.

Front Public Health ; 9: 672344, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-34249839

RESUMO

Due to the COVID-19 pandemic, higher education institutions were forced to make difficult decisions regarding the 2020-2021 academic year. Many institutions decided to have courses in an online remote format, others decided to attempt an in-person experience, while still others took a hybrid approach. Hope College (Holland, MI) decided that an in-person semester would be safer and more equitable for students. To achieve this at a residential college required broad collaboration across multiple stakeholders. Here, we share lessons learned and detail Hope College's model, including wastewater surveillance, comprehensive testing, contact tracing, and isolation procedures that allowed us to deliver on our commitment of an in-person, residential college experience.

Assuntos

COVID-19 , Educação a Distância , Pandemias , Humanos , Pandemias/prevenção & controle , SARS-CoV-2 , Universidades

3.

A global reconnaissance of particulates and metals/metalloids in untreated drinking water sources.

Peterson, Jonathan W; Fry, Benjamin M; Wade, Daniel R; Fishman, Ford J; Stid, Jacob T; Peterson, Jonas M; Tarp, Cleveland E; Wade, Randall D; Brokus, Sarah A; Pikaart, Michael J; Krueger, Brent P; Best, Aaron A.

Environ Monit Assess ; 193(5): 307, 2021 Apr 28.

Artigo em Inglês | MEDLINE | ID: mdl-33909163

RESUMO

Metal and metalloid contamination in drinking water sources is a global concern, particularly in developing countries. This study used hollow membrane water filters and metal-capturing polyurethane foams to sample 71 drinking water sources in 22 different countries. Field sampling was performed with sampling kits prepared in the lab at Hope College in Holland, MI, USA. Filters and foams were sent back to the lab after sampling, and subsequent analysis of flushates and rinsates allowed the estimation of suspended solids and metal and other analayte concentrations in source waters. Estimated particulate concentrations were 0-92 mg/L, and consisted of quartz, feldspar, and clay, with some samples containing metal oxides or sulfide phases. As and Cu were the only analytes which occurred above the World Health Organization (WHO) guidelines of 10 µg/L and 2000 µg/L, respectively, with As exceeding the guideline in 45% of the sources and Cu in 3%. Except for one value of ~ 285 µg/L, As concentrations were 45-200 µg/L (river), 65-179 µg/L (well), and 112-178 µg/L (tap). Other metals (Ce, Fe, Mg, Mn, Zn) with no WHO guideline were also detected, with Mn the most common. This study demonstrated that filters and foams can be used for reconnaissance characterization of untreated drinking water. However, estimated metal and other analyte concentrations could only be reported as minimum values due to potential incomplete retrieval of foam-bound analytes. A qualitative reporting methodology was used to report analytes as "present" if the concentration was below the WHO guideline, and "present-recommend retesting" if the concentration was quantifiable and above the WHO guideline.

Assuntos

Água Potável , Metaloides , Metais Pesados , Poluentes Químicos da Água , Monitoramento Ambiental , Humanos , Metaloides/análise , Metais Pesados/análise , Países Baixos , Poluentes Químicos da Água/análise

4.

Diarrhea prevalence in a randomized, controlled prospective trial of point-of-use water filters in homes and schools in the Dominican Republic.

Tintle, Nathan; Van De Griend, Kristin; Ulrich, Rachel; Wade, Randall D; Baar, Tena M; Boven, Emma; Cooper, Carolyn E A; Couch, Olivia; Eekhoff, Lauren; Fry, Benjamin; Goszkowicz, Grace K; Hecksel, Maya A; Heynen, Adam; Laughlin, Jade A; Les, Sydney M; Lombard, Taylor R; Munson, B Daniel; Peterson, Jonas M; Schumann, Eric; Settecerri, Daniel J; Spry, Jacob E; Summerfield, Matthew J; Sunder, Meghana; Wade, Daniel R; Zonnefeld, Caden G; Brokus, Sarah A; Moen, Francesco S; Slater, Adam D; Peterson, Jonathan W; Pikaart, Michael J; Krueger, Brent P; Best, Aaron A.

Trop Med Health ; 49(1): 1, 2021 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-33397511

RESUMO

BACKGROUND: Lack of sustainable access to clean drinking water continues to be an issue of paramount global importance, leading to millions of preventable deaths annually. Best practices for providing sustainable access to clean drinking water, however, remain unclear. Widespread installation of low-cost, in-home, point of use water filtration systems is a promising strategy. METHODS: We conducted a prospective, randomized, controlled trial whereby 16 villages were selected and randomly assigned to one of four treatment arms based on the installation location of Sawyer® PointONE™ filters (filter in both home and school; filter in home only; filter in school only; control group). Water samples and self-reported information on diarrhea were collected at multiple times throughout the study. RESULTS: Self-reported household prevalence of diarrhea decreased from 25.6 to 9.76% from installation to follow-up (at least 7 days, and up to 200 days post-filter installation). These declines were also observed in diarrhea with economic or educational consequences (diarrhea which led to medical treatment and/or missing school or work) with baseline prevalence of 9.64% declining to 1.57%. Decreases in diarrhea prevalence were observed across age groups. There was no evidence of a loss of efficacy of filters up to 200 days post-filter installation. Installation of filters in schools was not associated with decreases in diarrhea prevalence in school-aged children or family members. Unfiltered water samples both at schools and homes contained potential waterborne bacterial pathogens, dissolved heavy metals and metals associated with particulates. All dissolved metals were detected at levels below World Health Organization action guidelines. CONCLUSIONS: This controlled trial provides strong evidence of the effectiveness of point-of-use, hollow fiber membrane filters at reducing diarrhea from bacterial sources up to 200 days post-installation when installed in homes. No statistically significant reduction in diarrhea was found when filters were installed in schools. Further research is needed in order to explore filter efficacy and utilization after 200 days post-installation. TRIAL REGISTRATION: ClinicalTrials.gov, NCT03972618 . Registered 3 June 2019-retrospectively registered.

5.

The ModelSEED Biochemistry Database for the integration of metabolic annotations and the reconstruction, comparison and analysis of metabolic models for plants, fungi and microbes.

Seaver, Samuel M D; Liu, Filipe; Zhang, Qizhi; Jeffryes, James; Faria, José P; Edirisinghe, Janaka N; Mundy, Michael; Chia, Nicholas; Noor, Elad; Beber, Moritz E; Best, Aaron A; DeJongh, Matthew; Kimbrel, Jeffrey A; D'haeseleer, Patrik; McCorkle, Sean R; Bolton, Jay R; Pearson, Erik; Canon, Shane; Wood-Charlson, Elisha M; Cottingham, Robert W; Arkin, Adam P; Henry, Christopher S.

Nucleic Acids Res ; 49(D1): D1555, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-33179751

6.

The ModelSEED Biochemistry Database for the integration of metabolic annotations and the reconstruction, comparison and analysis of metabolic models for plants, fungi and microbes.

Seaver, Samuel M D; Liu, Filipe; Zhang, Qizhi; Jeffryes, James; Faria, José P; Edirisinghe, Janaka N; Mundy, Michael; Chia, Nicholas; Noor, Elad; Beber, Moritz E; Best, Aaron A; DeJongh, Matthew; Kimbrel, Jeffrey A; D'haeseleer, Patrik; McCorkle, Sean R; Bolton, Jay R; Pearson, Erik; Canon, Shane; Wood-Charlson, Elisha M; Cottingham, Robert W; Arkin, Adam P; Henry, Christopher S.

Nucleic Acids Res ; 49(D1): D575-D588, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-32986834

RESUMO

For over 10 years, ModelSEED has been a primary resource for the construction of draft genome-scale metabolic models based on annotated microbial or plant genomes. Now being released, the biochemistry database serves as the foundation of biochemical data underlying ModelSEED and KBase. The biochemistry database embodies several properties that, taken together, distinguish it from other published biochemistry resources by: (i) including compartmentalization, transport reactions, charged molecules and proton balancing on reactions; (ii) being extensible by the user community, with all data stored in GitHub; and (iii) design as a biochemical 'Rosetta Stone' to facilitate comparison and integration of annotations from many different tools and databases. The database was constructed by combining chemical data from many resources, applying standard transformations, identifying redundancies and computing thermodynamic properties. The ModelSEED biochemistry is continually tested using flux balance analysis to ensure the biochemical network is modeling-ready and capable of simulating diverse phenotypes. Ontologies can be designed to aid in comparing and reconciling metabolic reconstructions that differ in how they represent various metabolic pathways. ModelSEED now includes 33,978 compounds and 36,645 reactions, available as a set of extensible files on GitHub, and available to search at https://modelseed.org/biochem and KBase.

Assuntos

Bactérias/metabolismo , Bases de Dados Factuais , Fungos/metabolismo , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Plantas/metabolismo , Bactérias/genética , Genoma Bacteriano , Termodinâmica

7.

Evaluating the efficacy of point-of-use water filtration units in Fiji.

Tintle, Nathan; Heynen, Adam; Van De Griend, Kristin; Ulrich, Rachel; Ojo, Matthew; Boven, Emma; Brokus, Sarah; Wade, Randall; Best, Aaron A.

Trop Med Health ; 47: 48, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-31410085

RESUMO

BACKGROUND: To develop and evaluate a strategy for reducing the prevalence and impact of waterborne disease, a water quality intervention was developed for Fiji by Give Clean Water, Inc. in partnership with the Fiji Ministry of Health. Residents were provided and trained on how to use a Sawyer® PointONE™ filter, while also being taught proper handwashing techniques. At the time of the filter installation, all households were surveyed inquiring about the prior 2- to 4-week period. Households were measured a second time between 19 and 225 days later (mean = 66 days). RESULTS: To date, five economic and health outcomes have been tracked on 503 households to evaluate the efficacy of the intervention. When comparing baseline to follow-up among the 503 households, the 2-week diarrhea prevalence decreased in households from 17.5% at baseline to 1.8% at follow-up. Also, the 2-week prevalence of severe diarrhea decreased per household from 9.7% at baseline to 0.6% at follow-up. Finally, monthly diarrhea-related medical costs reduced by an average of Fijian (FJ) $3.54 per person, and monthly water expenses reduced by FJ $0.63 per person. All estimated values are obtained from general linear and logistic mixed-effect models, which adjusted for location, season, time to follow-up, household size, water source, and respondent changing. Changes in economic and health outcomes from installation to follow-up were statistically significant (p < 0.05) in all cases, in both unadjusted and adjusted models. CONCLUSIONS: The installation of water filters shows promise for the reduction of diarrhea prevalence in Fiji, as well as the reduction of diarrhea-related medical costs and water expenses. Future work entails evaluation in other countries and contexts, long-term health monitoring, and comparison to alternative water quality interventions.

8.

KBase: The United States Department of Energy Systems Biology Knowledgebase.

Arkin, Adam P; Cottingham, Robert W; Henry, Christopher S; Harris, Nomi L; Stevens, Rick L; Maslov, Sergei; Dehal, Paramvir; Ware, Doreen; Perez, Fernando; Canon, Shane; Sneddon, Michael W; Henderson, Matthew L; Riehl, William J; Murphy-Olson, Dan; Chan, Stephen Y; Kamimura, Roy T; Kumari, Sunita; Drake, Meghan M; Brettin, Thomas S; Glass, Elizabeth M; Chivian, Dylan; Gunter, Dan; Weston, David J; Allen, Benjamin H; Baumohl, Jason; Best, Aaron A; Bowen, Ben; Brenner, Steven E; Bun, Christopher C; Chandonia, John-Marc; Chia, Jer-Ming; Colasanti, Ric; Conrad, Neal; Davis, James J; Davison, Brian H; DeJongh, Matthew; Devoid, Scott; Dietrich, Emily; Dubchak, Inna; Edirisinghe, Janaka N; Fang, Gang; Faria, José P; Frybarger, Paul M; Gerlach, Wolfgang; Gerstein, Mark; Greiner, Annette; Gurtowski, James; Haun, Holly L; He, Fei; Jain, Rashmi.

Nat Biotechnol ; 36(7): 566-569, 2018 07 06.

Artigo em Inglês | MEDLINE | ID: mdl-29979655

Assuntos

Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados/tendências , Bases de Conhecimento , Biologia de Sistemas/tendências , Humanos , Estados Unidos

9.

Genome Sequences of Mycobacteriophages Amgine, Amohnition, Bella96, Cain, DarthP, Hammy, Krueger, LastHope, Peanam, PhelpsODU, Phrank, SirPhilip, Slimphazie, and Unicorn.

Anders, Kirk R; Barekzi, Nazir; Best, Aaron A; Frederick, Gregory D; Mavrodi, Dmitri V; Vazquez, Edwin; Amoh, Nana Yaa A; Baliraine, Frederick N; Buchser, William J; Cast, Thomas P; Chamberlain, Carmen E; Chung, Hui-Min; D'Angelo, William A; Farris, Christian T; Fernandez-Martinez, Mariceli; Fischman, Haley D; Forsyth, Mark H; Fortier, Anna G; Gallo, Kara F; Held, Greta J; Lomas, Miguel A; Maldonado-Vazquez, Natalia Y; Moonsammy, Claudia H; Namboote, Peace; Paudel, Sudip; Polley, Sarah-Elizabeth M; Reyes, Gabriella M; Rubin, Michael R; Saha, Margaret S; Stukey, Joseph; Tobias, Tristan D; Garlena, Rebecca A; Stoner, Ty H; Cresawn, Steven G; Jacobs-Sera, Deborah; Pope, Welkin H; Russell, Daniel A; Hatfull, Graham F.

Genome Announc ; 5(49)2017 Dec 07.

Artigo em Inglês | MEDLINE | ID: mdl-29217790

RESUMO

We report the genome sequences of 14 cluster K mycobacteriophages isolated using Mycobacterium smegmatis mc²155 as host. Four are closely related to subcluster K1 phages, and 10 are members of subcluster K6. The phage genomes span considerable sequence diversity, including multiple types of integrases and integration sites.

10.

Whole-Genome Shotgun Sequences of Salmonella enterica Serovar Typhimurium Lilleengen Type Strains LT1, LT18, LT19, LT20, LT21, and LT22.

Kazmierczak, Robert A; Best, Aaron A; Nguyen, Duy; Eisenstark, Abraham.

Genome Announc ; 5(30)2017 Jul 27.

Artigo em Inglês | MEDLINE | ID: mdl-28751402

RESUMO

The Lilleengen type (LT) collection of Salmonella enterica serovar Typhimurium strains has served the scientific community as a group of model organisms for basic genetic and biochemical pathway research. Here, we report the whole-genome shotgun sequences of Salmonella enterica serovar Typhimurium strains LT1, LT18, LT19, LT20, LT21, and LT22.

11.

IDENTIFICATION AND ANALYSIS OF BACTERIAL GENOMIC METABOLIC SIGNATURES.

Bowerman, Nathaniel; Tintle, Nathan; Dejongh, Matthew; Best, Aaron A.

Pac Symp Biocomput ; 22: 3-14, 2017.

Artigo em Inglês | MEDLINE | ID: mdl-27896957

RESUMO

With continued rapid growth in the number and quality of fully sequenced and accurately annotated bacterial genomes, we have unprecedented opportunities to understand metabolic diversity. We selected 101 diverse and representative completely sequenced bacteria and implemented a manual curation effort to identify 846 unique metabolic variants present in these bacteria. The presence or absence of these variants act as a metabolic signature for each of the bacteria, which can then be used to understand similarities and differences between and across bacterial groups. We propose a novel and robust method of summarizing metabolic diversity using metabolic signatures and use this method to generate a metabolic tree, clustering metabolically similar organisms. Resulting analysis of the metabolic tree confirms strong associations with well-established biological results along with direct insight into particular metabolic variants which are most predictive of metabolic diversity. The positive results of this manual curation effort and novel method development suggest that future work is needed to further expand the set of bacteria to which this approach is applied and use the resulting tree to test broad questions about metabolic diversity and complexity across the bacterial tree of life.

Assuntos

Bactérias/genética , Bactérias/metabolismo , Genoma Bacteriano , Bactérias/classificação , Biologia Computacional , Variação Genética , Redes e Vias Metabólicas/genética , Fenótipo , Filogenia

12.

Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation.

Faria, José P; Davis, James J; Edirisinghe, Janaka N; Taylor, Ronald C; Weisenhorn, Pamela; Olson, Robert D; Stevens, Rick L; Rocha, Miguel; Rocha, Isabel; Best, Aaron A; DeJongh, Matthew; Tintle, Nathan L; Parrello, Bruce; Overbeek, Ross; Henry, Christopher S.

Front Microbiol ; 7: 1819, 2016.

Artigo em Inglês | MEDLINE | ID: mdl-27933038

RESUMO

Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain.

13.

A Bayesian Framework for the Classification of Microbial Gene Activity States.

Disselkoen, Craig; Greco, Brian; Cook, Kaitlyn; Koch, Kristin; Lerebours, Reginald; Viss, Chase; Cape, Joshua; Held, Elizabeth; Ashenafi, Yonatan; Fischer, Karen; Acosta, Allyson; Cunningham, Mark; Best, Aaron A; DeJongh, Matthew; Tintle, Nathan.

Front Microbiol ; 7: 1191, 2016.

Artigo em Inglês | MEDLINE | ID: mdl-27555837

RESUMO

Numerous methods for classifying gene activity states based on gene expression data have been proposed for use in downstream applications, such as incorporating transcriptomics data into metabolic models in order to improve resulting flux predictions. These methods often attempt to classify gene activity for each gene in each experimental condition as belonging to one of two states: active (the gene product is part of an active cellular mechanism) or inactive (the cellular mechanism is not active). These existing methods of classifying gene activity states suffer from multiple limitations, including enforcing unrealistic constraints on the overall proportions of active and inactive genes, failing to leverage a priori knowledge of gene co-regulation, failing to account for differences between genes, and failing to provide statistically meaningful confidence estimates. We propose a flexible Bayesian approach to classifying gene activity states based on a Gaussian mixture model. The model integrates genome-wide transcriptomics data from multiple conditions and information about gene co-regulation to provide activity state confidence estimates for each gene in each condition. We compare the performance of our novel method to existing methods on both simulated data and real data from 907 E. coli gene expression arrays, as well as a comparison with experimentally measured flux values in 29 conditions, demonstrating that our method provides more consistent and accurate results than existing methods across a variety of metrics.

14.

Characterization of Gut Microbiome Dynamics in Developing Pekin Ducks and Impact of Management System.

Best, Aaron A; Porter, Amanda L; Fraley, Susan M; Fraley, Gregory S.

Front Microbiol ; 7: 2125, 2016.

Artigo em Inglês | MEDLINE | ID: mdl-28101086

RESUMO

Little to no research has been conducted on the gut microbiome of the Pekin duck, yet over 24.5 million ducks are raised for human consumption each year in the United States alone. Knowledge of the microbiome could lead to an understanding of the effects of growing conditions such as the use of prebiotics, probiotics, and enzymes in feeding practices, the use of antibiotics, and the sources of pathogenic bacteria in diseased ducks. In order to characterize changes in the caecal microbiome that occur as ducks develop through a typical industry grow-out period, a 16S rRNA community analysis of caecal contents collected over a 6-week period was conducted using a next generation sequencing approach. Transitions in the composition of the caecal microbiome occurred throughout the lifespan, with a large shift during days 4 through 10 posthatch. Two major phyla of bacteria were found to be present within the caeca of aviary raised ducks, with the relative abundance of each phylum varying by age of the duck. Proteobacteria is dominant for the first 3 days of age, and Firmicutes increases and dominates beginning at day 4. Barn raised ducks contained a significant population of Bacteroidetes in addition to Proteobacteria and Firmicutes at later developmental time points, though this phylum was absent in aviary raised ducks. Genera containing pathogens of anseriformes most often found in industry settings were either absent or found as normal parts of the caecal microbial populations. The high level differences in phylum abundance highlight the importance of well-designed sampling strategies for microbiome based studies. Results showed clear distinctions between Pekin Duck caecal contents and those of Broiler Chickens and Turkey in a qualitative comparison. These data provide a reference point for studies of the Pekin Duck through industry grow-out ages, provide a foundation for understanding the types of bacteria that promote health, and may lead to improved methods to increase yields and decrease instances of disease in agricultural production processes.

15.

Cautions about the reliability of pairwise gene correlations based on expression data.

Powers, Scott; DeJongh, Matt; Best, Aaron A; Tintle, Nathan L.

Front Microbiol ; 6: 650, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-26167162

RESUMO

BACKGROUND: Rapid growth in the availability of genome-wide transcript abundance levels through gene expression microarrays and RNAseq promises to provide deep biological insights into the complex, genome-wide transcriptional behavior of single-celled organisms. However, this promise has not yet been fully realized. RESULTS: We find that computation of pairwise gene associations (correlation; mutual information) across a set of 2782 total genome-wide expression samples from six diverse bacteria produces unexpectedly large variation in estimates of pairwise gene association-regardless of the metric used, the organism under study, or the number and source of the samples. We pinpoint the cause to sampling bias. In particular, in repositories of expression data (e.g., Gene Expression Omnibus, GEO), many individual genes show small differences in absolute gene expression levels across the set of samples. We demonstrate that these small differences are due mainly to "noise" instead of "signal" attributable to environmental or genetic perturbations. We show that downstream analysis using gene expression levels of genes with small differences yields biased estimates of pairwise association. CONCLUSIONS: We propose flagging genes with small differences in absolute, RMA-normalized, expression levels (e.g., standard deviation less than 0.5), as potentially yielding biased pairwise association metrics. This strategy has the potential to substantially improve the confidence in genome-wide conclusions about transcriptional behavior in bacterial organisms. Further work is needed to further refine strategies to identify genes with small difference in expression levels prior to computing gene-gene association metrics.

16.

Cluster J mycobacteriophages: intron splicing in capsid and tail genes.

Pope, Welkin H; Jacobs-Sera, Deborah; Best, Aaron A; Broussard, Gregory W; Connerly, Pamela L; Dedrick, Rebekah M; Kremer, Timothy A; Offner, Susan; Ogiefo, Amenawon H; Pizzorno, Marie C; Rockenbach, Kate; Russell, Daniel A; Stowe, Emily L; Stukey, Joseph; Thibault, Sarah A; Conway, James F; Hendrix, Roger W; Hatfull, Graham F.

PLoS One ; 8(7): e69273, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-23874930

RESUMO

Bacteriophages isolated on Mycobacterium smegmatis mc(2)155 represent many distinct genomes sharing little or no DNA sequence similarity. The genomes are architecturally mosaic and are replete with genes of unknown function. A new group of genomes sharing substantial nucleotide sequences constitute Cluster J. The six mycobacteriophages forming Cluster J are morphologically members of the Siphoviridae, but have unusually long genomes ranging from 106.3 to 117 kbp. Reconstruction of the capsid by cryo-electron microscopy of mycobacteriophage BAKA reveals an icosahedral structure with a triangulation number of 13. All six phages are temperate and homoimmune, and prophage establishment involves integration into a tRNA-Leu gene not previously identified as a mycobacterial attB site for phage integration. The Cluster J genomes provide two examples of intron splicing within the virion structural genes, one in a major capsid subunit gene, and one in a tail gene. These genomes also contain numerous free-standing HNH homing endonuclease, and comparative analysis reveals how these could contribute to genome mosaicism. The unusual Cluster J genomes provide new insights into phage genome architecture, gene function, capsid structure, gene mobility, intron splicing, and evolution.

Assuntos

Proteínas do Capsídeo/genética , Micobacteriófagos/classificação , Micobacteriófagos/genética , Proteínas da Cauda Viral/genética , Sequência de Aminoácidos , Bacteriólise/genética , Composição de Bases , Sequência de Bases , Proteínas do Capsídeo/química , Análise por Conglomerados , Elementos de DNA Transponíveis , Ordem dos Genes , Tamanho do Genoma , Genoma Viral , Íntrons , Dados de Sequência Molecular , Micobacteriófagos/ultraestrutura , Fases de Leitura Aberta , Filogenia , Splicing de RNA , Proteínas da Cauda Viral/química , Vírion/ultraestrutura , Integração Viral/genética

17.

Genomic reconstruction of transcriptional regulatory networks in lactic acid bacteria.

Ravcheev, Dmitry A; Best, Aaron A; Sernova, Natalia V; Kazanov, Marat D; Novichkov, Pavel S; Rodionov, Dmitry A.

BMC Genomics ; 14: 94, 2013 Feb 12.

Artigo em Inglês | MEDLINE | ID: mdl-23398941

RESUMO

BACKGROUND: Genome scale annotation of regulatory interactions and reconstruction of regulatory networks are the crucial problems in bacterial genomics. The Lactobacillales order of bacteria collates various microorganisms having a large economic impact, including both human and animal pathogens and strains used in the food industry. Nonetheless, no systematic genome-wide analysis of transcriptional regulation has been previously made for this taxonomic group. RESULTS: A comparative genomics approach was used for reconstruction of transcriptional regulatory networks in 30 selected genomes of lactic acid bacteria. The inferred networks comprise regulons for 102 orthologous transcription factors (TFs), including 47 novel regulons for previously uncharacterized TFs. Numerous differences between regulatory networks of the Streptococcaceae and Lactobacillaceae groups were described on several levels. The two groups are characterized by substantially different sets of TFs encoded in their genomes. Content of the inferred regulons and structure of their cognate TF binding motifs differ for many orthologous TFs between the two groups. Multiple cases of non-orthologous displacements of TFs that control specific metabolic pathways were reported. CONCLUSIONS: The reconstructed regulatory networks substantially expand the existing knowledge of transcriptional regulation in lactic acid bacteria. In each of 30 studied genomes the obtained regulatory network contains on average 36 TFs and 250 target genes that are mostly involved in carbohydrate metabolism, stress response, metal homeostasis and amino acids biosynthesis. The inferred networks can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. All reconstructed regulons are captured within the Streptococcaceae and Lactobacillaceae collections in the RegPrecise database (http://regprecise.lbl.gov).

Assuntos

Redes Reguladoras de Genes , Genoma Bacteriano , Lactobacillales/genética , Streptococcaceae/genética , Aminoácidos/metabolismo , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Metabolismo dos Carboidratos/genética , Hibridização Genômica Comparativa , Lactobacillales/classificação , Metais/metabolismo , Streptococcaceae/classificação , Estresse Fisiológico/genética , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo

18.

Automated genome annotation and metabolic model reconstruction in the SEED and Model SEED.

Devoid, Scott; Overbeek, Ross; DeJongh, Matthew; Vonstein, Veronika; Best, Aaron A; Henry, Christopher.

Methods Mol Biol ; 985: 17-45, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-23417797

RESUMO

Over the past decade, genome-scale metabolic models have proven to be a crucial resource for predicting organism phenotypes from genotypes. These models provide a means of rapidly translating detailed knowledge of thousands of enzymatic processes into quantitative predictions of whole-cell behavior. Until recently, the pace of new metabolic model development was eclipsed by the pace at which new genomes were being sequenced. To address this problem, the RAST and the Model SEED framework were developed as a means of automatically producing annotations and draft genome-scale metabolic models. In this chapter, we describe the automated model reconstruction process in detail, starting from a new genome sequence and finishing on a functioning genome-scale metabolic model. We break down the model reconstruction process into eight steps: submitting a genome sequence to RAST, annotating the genome, curating the annotation, submitting the annotation to Model SEED, reconstructing the core model, generating the draft biomass reaction, auto-completing the model, and curating the model. Each of these eight steps is documented in detail.

Assuntos

Redes e Vias Metabólicas/genética , Modelos Genéticos , Anotação de Sequência Molecular/métodos , Interface Usuário-Computador , Bases de Dados Genéticas , Genoma Bacteriano , Sistemas On-Line , Análise de Sequência de DNA , Software

19.

Comparative genomics and functional analysis of rhamnose catabolic pathways and regulons in bacteria.

Rodionova, Irina A; Li, Xiaoqing; Thiel, Vera; Stolyar, Sergey; Stanton, Krista; Fredrickson, James K; Bryant, Donald A; Osterman, Andrei L; Best, Aaron A; Rodionov, Dmitry A.

Front Microbiol ; 4: 407, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-24391637

RESUMO

L-rhamnose (L-Rha) is a deoxy-hexose sugar commonly found in nature. L-Rha catabolic pathways were previously characterized in various bacteria including Escherichia coli. Nevertheless, homology searches failed to recognize all the genes for the complete L-Rha utilization pathways in diverse microbial species involved in biomass decomposition. Moreover, the regulatory mechanisms of L-Rha catabolism have remained unclear in most species. A comparative genomics approach was used to reconstruct the L-Rha catabolic pathways and transcriptional regulons in the phyla Actinobacteria, Bacteroidetes, Chloroflexi, Firmicutes, Proteobacteria, and Thermotogae. The reconstructed pathways include multiple novel enzymes and transporters involved in the utilization of L-Rha and L-Rha-containing polymers. Large-scale regulon inference using bioinformatics revealed remarkable variations in transcriptional regulators for L-Rha utilization genes among bacteria. A novel bifunctional enzyme, L-rhamnulose-phosphate aldolase (RhaE) fused to L-lactaldehyde dehydrogenase (RhaW), which is not homologous to previously characterized L-Rha catabolic enzymes, was identified in diverse bacteria including Chloroflexi, Bacilli, and Alphaproteobacteria. By using in vitro biochemical assays we validated both enzymatic activities of the purified recombinant RhaEW proteins from Chloroflexus aurantiacus and Bacillus subtilis. Another novel enzyme of the L-Rha catabolism, L-lactaldehyde reductase (RhaZ), was identified in Gammaproteobacteria and experimentally validated by in vitro enzymatic assays using the recombinant protein from Salmonella typhimurium. C. aurantiacus induced transcription of the predicted L-Rha utilization genes when L-Rha was present in the growth medium and consumed L-Rha from the medium. This study provided comprehensive insights to L-Rha catabolism and its regulation in diverse Bacteria.

20.

Diversity and versatility of the Thermotoga maritima sugar kinome.

Rodionova, Irina A; Yang, Chen; Li, Xiaoqing; Kurnasov, Oleg V; Best, Aaron A; Osterman, Andrei L; Rodionov, Dmitry A.

J Bacteriol ; 194(20): 5552-63, 2012 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-22885293

RESUMO

Sugar phosphorylation is an indispensable committed step in a large variety of sugar catabolic pathways, which are major suppliers of carbon and energy in heterotrophic species. Specialized sugar kinases that are indispensable for most of these pathways can be utilized as signature enzymes for the reconstruction of carbohydrate utilization machinery from microbial genomic and metagenomic data. Sugar kinases occur in several structurally distinct families with various partially overlapping as well as yet unknown substrate specificities that often cannot be accurately assigned by homology-based techniques. A subsystems-based metabolic reconstruction combined with the analysis of genome context and followed by experimental testing of predicted gene functions is a powerful approach of functional gene annotation. Here we applied this integrated approach for functional mapping of all sugar kinases constituting an extensive and diverse sugar kinome in the thermophilic bacterium Thermotoga maritima. Substrate preferences of 14 kinases mainly from the FGGY and PfkB families were inferred by bioinformatics analysis and biochemically characterized by screening with a panel of 45 different carbohydrates. Most of the analyzed enzymes displayed narrow substrate preferences corresponding to their predicted physiological roles in their respective catabolic pathways. The observed consistency supports the choice of kinases as signature enzymes for genomics-based identification and reconstruction of sugar utilization pathways. Use of the integrated genomic and experimental approach greatly speeds up the identification of the biochemical function of unknown proteins and improves the quality of reconstructed pathways.

Assuntos

Metabolismo dos Carboidratos , Fosfotransferases/genética , Fosfotransferases/metabolismo , Thermotoga maritima/enzimologia , Thermotoga maritima/genética , Biologia Computacional , Genoma , Fosforilação , Proteoma , Especificidade por Substrato

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA